Constrained overlapping clusters: minimizing the negative effects of bridge-nodes
نویسندگان
چکیده
This paper presents a new approach to forming overlapping clusters of objects by balancing the effects of incompleteness, impurity and overlap. Incompleteness results from similar objects separated into different clusters while impurity arises when a cluster contains dissimilar objects. Overlap is caused by nodes that appear in more than one cluster. The key to balancing these effects is the identification of bridge-nodes. We show the limitations of traditional clustering algorithms in handling bridge nodes and demonstrate the intractability of minimizing all three effects. Approximation algorithms based on graph mincut and genetic algorithm are proposed to minimize these effects. Our results with real data sets show significant improvement over traditional methods with regard to incompleteness, impurity and overlap.
منابع مشابه
Clustering in the Presence of Bridge-Nodes
In this paper, we study the ill-effects of bridgenodes, which causes many dissimilar objects to be placed together in the same cluster by existing clustering algorithms. We offer two new metrics for measuring how well a clustering algorithm handles the presence of bridge-nodes. We also illustrate how algorithms that produce overlapping clusters help to alleviate the effect of bridge-nodes and f...
متن کاملIdentifying overlapping communities using multi-agent collective intelligence
The proposed algorithm in this research is based on the multi-agent particle swarm optimization as a collective intelligence due to the connection between several simple components which enables them to regulate their behavior and relationships with the rest of the group according to certain rules. As a result, self-organizing in collective activities can be seen. Community structure is crucial...
متن کاملTarget Detection Improvements in Hyperspectral Images by Adjusting Band Weights and Identifying end-members in Feature Space Clusters
Spectral target detection could be regarded as one of the strategic applications of hyperspectral data analysis. The presence of targets in an area smaller than a pixel’s ground coverage has led to the development of spectral un-mixing methods to detect these types of targets. Usually, in the spectral un-mixing algorithms, the similar weights have been assumed for spectral bands. Howe...
متن کاملInteractive, Constraint-based Layout of Engineering Diagrams
Many engineering disciplines require visualisation of networks. Constrained graph layout is a powerful new approach to network layout that allows the user to impose a wide variety application-specific placement constraints—such as downwards pointing directed edges, alignment of nodes, cluster containment and non-overlapping nodes and clusters—on the layout. We have recently developed an efficie...
متن کاملThe Overlapped K-hop (OK) Clustering Algorithm
Clustering is a standard approach for achieving efficient and scalable performance in wireless sensor networks. Clustering algorithms are mostly heuristic in nature and aim at generating the minimum number of disjoint clusters. In this report, we formulate the overlapping multi-hop clustering problem as an extension to the k-dominating set problem. Then we propose a fast, randomized, distribute...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Statistical Analysis and Data Mining
دوره 3 شماره
صفحات -
تاریخ انتشار 2010